AITopics | Track & Field

c28e5b0c9841b5ef396f9f519bf6c217-Supplemental.pdf

Neural Information Processing SystemsMar-20-2025, 12:01:31 GMT

consumer health, health & medicine, social media, (11 more...)

Neural Information Processing Systems

Country: North America (0.94)

Industry:

Health & Medicine > Consumer Health (1.00)
Leisure & Entertainment > Sports > Track & Field (0.94)
Consumer Products & Services (0.94)

Technology: Information Technology > Artificial Intelligence (0.69)

Add feedback

GAIA: Rethinking Action Quality Assessment for AI-Generated Videos

Neural Information Processing SystemsMar-20-2025, 05:20:07 GMT

Assessing action quality is both imperative and challenging due to its significant impact on the quality of AI-generated videos, further complicated by the inherently ambiguous nature of actions within AI-generated video (AIGV). Current action quality assessment (AQA) algorithms predominantly focus on actions from real specific scenarios and are pre-trained with normative action features, thus rendering them inapplicable in AIGVs. To address these problems, we construct GAIA, a Generic AI-generated Action dataset, by conducting a large-scale subjective evaluation from a novel causal reasoning-based perspective, resulting in 971,244 ratings among 9,180 video-action pairs. Based on GAIA, we evaluate a suite of popular text-to-video (T2V) models on their ability to generate visually rational actions, revealing their pros and cons on different categories of actions. We also extend GAIA as a testbed to benchmark the AQA capacity of existing automatic evaluation methods. Results show that traditional AQA methods, action-related metrics in recent T2V benchmarks, and mainstream video quality methods perform poorly with an average SRCC of 0.454, 0.191, and 0.519, respectively, indicating a sizable gap between current models and human action perception patterns in AIGVs. Our findings underscore the significance of action quality as a unique perspective for studying AIGVs and can catalyze progress towards methods with enhanced capacities for AQA in AIGVs.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > Middle East > Israel (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Information Technology (1.00)
Leisure & Entertainment > Sports > Track & Field (0.92)
Health & Medicine > Consumer Health (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)

Add feedback

Supplementary Materials for GAIA: Rethinking Action Quality Assessment for AI-Generated Videos

Neural Information Processing SystemsMar-20-2025, 05:20:07 GMT

Detailed category of each action keyword in our GAIA dataset are listed in Tab. 2, Tab. 3, Tab.

artificial intelligence, dataset, machine learning, (14 more...)

Neural Information Processing Systems

Country: Asia > China (0.15)

Industry:

Law (0.94)
Leisure & Entertainment > Sports > Track & Field (0.93)
Health & Medicine > Consumer Health (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.95)

Add feedback

Continual Learning for Multiple Modalities

Jin, Hyundong, Kim, Eunwoo

arXiv.org Artificial IntelligenceMar-11-2025

Continual learning aims to learn knowledge of tasks observed in sequential time steps while mitigating the forgetting of previously learned knowledge. Existing methods were proposed under the assumption of learning a single modality (e.g., image) over time, which limits their applicability in scenarios involving multiple modalities. In this work, we propose a novel continual learning framework that accommodates multiple modalities (image, video, audio, depth, and text). We train a model to align various modalities with text, leveraging its rich semantic information. However, this increases the risk of forgetting previously learned knowledge, exacerbated by the differing input traits of each task. To alleviate the overwriting of the previous knowledge of modalities, we propose a method for aggregating knowledge within and across modalities. The aggregated knowledge is obtained by assimilating new information through self-regularization within each modality and associating knowledge between modalities by prioritizing contributions from relevant modalities. Furthermore, we propose a strategy that re-aligns the embeddings of modalities to resolve biased alignment between modalities. We evaluate the proposed method in a wide range of continual learning scenarios using multiple datasets with different modalities. Extensive experiments demonstrate that ours outperforms existing methods in the scenarios, regardless of whether the identity of the modality is given.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2503.08064

Country: North America > United States (0.14)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Sports > Track & Field (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.96)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.66)

Add feedback

1 Details for Dataset Partitioning

Neural Information Processing SystemsFeb-11-2025, 08:26:00 GMT

In this section, we present the comparison of Meta-Adapter and other methods on the remaining seven datasets under different few-shot settings in Table 1. We provide the comparison of Meta-Adapter with the SOTA prompt-learning method, CoCoOp [9] in Figure 1. All experiments are conducted under the 16-shot setting. It is clear that Meta-Adapter demonstrates superior generalizability over CoCoOp by large margins. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry:

Transportation > Passenger (1.00)
Transportation > Infrastructure & Services (1.00)
Transportation > Air (1.00)
(6 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.54)

Add feedback

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts Ahmad Beirami

Neural Information Processing SystemsFeb-11-2025, 07:47:31 GMT

Video self-supervised learning (VSSL) has made significant progress in recent years. However, the exact behavior and dynamics of these models under different forms of distribution shift are not yet known. In this paper, we comprehensively study the behavior of six popular self-supervised methods (v-SimCLR, v-MoCo, v-BYOL, v-SimSiam, v-DINO, v-MAE) in response to various forms of natural distribution shift, i.e., (i) context shift, (ii) viewpoint shift, (iii) actor shift, (iv) source shift, (v) generalizability to unknown classes (zero-shot), and (vi) open-set recognition. To perform this extensive study, we carefully craft a test bed consisting of 17 in-distribution and out-of-distribution benchmark pairs using available public datasets and a series of evaluation protocols to stress-test the different methods under the intended shifts.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America (0.27)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment > Sports > Track & Field (1.00)
Health & Medicine > Consumer Health (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
(3 more...)

Add feedback

Real-time Monitoring and Analysis of Track and Field Athletes Based on Edge Computing and Deep Reinforcement Learning Algorithm

Tang, Xiaowei, Long, Bin, Zhou, Li

arXiv.org Artificial IntelligenceNov-11-2024

As a fundamental sports discipline, track and field not In recent years, real-time monitoring and data analysis only forms the core of major events like the Olympics have become increasingly critical in enhancing athletic and World Championships but also plays a crucial role in performance. Studies have shown that by monitoring physiological promoting public health Jacobsson, Ekberg, Timpka, Haggren indicators (such as heart rate, body temperature, and Råsberg, Sjöberg, Mirkovic and Nilsson (2020); Timpka, blood oxygen saturation) and performance metrics (such as Dahlström, Fagher, Adami, Andersson, Jacobsson, Svedin speed, acceleration, and force) in real-time, it is possible to and Bermon (2022). The wide variety of track and field events, identify problems during training promptly and make targeted including sprints, middle and long-distance running, jumps, adjustments. For example, analyzing heart rate changes under and throws, demand high levels of physical fitness, technical different training intensities can assess endurance levels and skills, and mental strength from athletes Guo (2022); Zhang recovery status, while monitoring gait and acceleration during et al. (2023a). To excel in such competitive environments, running can optimize technical movements and improve athletes require not only innate talent and dedication but efficiency Rana and Mittal (2020a). Many studies have begun also scientific and systematic training methods Zhang et al. exploring the potential of using sensor technology and data (2023b); Yuan et al. (2024).

machine learning, real time system, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2411.0672

Country:

North America > United States (0.28)
North America > Canada > Quebec > Montreal (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Leisure & Entertainment > Sports > Running (0.68)
Leisure & Entertainment > Sports > Track & Field (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Architecture > Real Time Systems (1.00)

Add feedback

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts Ahmad Beirami Vector Institute

Neural Information Processing SystemsOct-7-2024, 02:41:49 GMT

Video self-supervised learning (VSSL) has made significant progress in recent years. However, the exact behavior and dynamics of these models under different forms of distribution shift are not yet known. In this paper, we comprehensively study the behavior of six popular self-supervised methods (v-SimCLR, v-MoCo, v-BYOL, v-SimSiam, v-DINO, v-MAE) in response to various forms of natural distribution shift, i.e., (i) context shift, (ii) viewpoint shift, (iii) actor shift, (iv) source shift, (v) generalizability to unknown classes (zero-shot), and (vi) open-set recognition. To perform this extensive study, we carefully craft a test bed consisting of 17 in-distribution and out-of-distribution benchmark pairs using available public datasets and a series of evaluation protocols to stress-test the different methods under the intended shifts.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America (0.27)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment > Sports > Track & Field (1.00)
Health & Medicine > Consumer Health (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
(3 more...)

Add feedback

1 Details for Dataset Partitioning

Neural Information Processing SystemsOct-5-2024, 13:51:44 GMT

In this section, we present the comparison of Meta-Adapter and other methods on the remaining seven datasets under different few-shot settings in Table 1. We provide the comparison of Meta-Adapter with the SOTA prompt-learning method, CoCoOp [9] in Figure 1. All experiments are conducted under the 16-shot setting. It is clear that Meta-Adapter demonstrates superior generalizability over CoCoOp by large margins. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories.

artificial intelligence, inductive learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Industry:

Transportation > Passenger (1.00)
Transportation > Infrastructure & Services (1.00)
Transportation > Air (1.00)
(6 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.54)

Add feedback

How AI taught Cassie the two-legged robot to run and jump

MIT Technology ReviewMar-18-2024, 14:00:00 GMT

Researchers used an AI technique called reinforcement learning to help a two-legged robot nicknamed Cassie to run 400 meters, over varying terrains, and execute standing long jumps and high jumps, without being trained explicitly on each movement. Reinforcement learning works by rewarding or penalizing an AI as it tries to carry out an objective. In this case, the approach taught the robot to generalize and respond in new scenarios, instead of freezing like its predecessors may have done. "We wanted to push the limits of robot agility," says Zhongyu Li, a PhD student at University of California, Berkeley, who worked on the project, which has not yet been peer-reviewed. "The high-level goal was to teach the robot to learn how to do all kinds of dynamic motions the way a human does."

artificial intelligence, machine learning, reinforcement learning, (7 more...)

MIT Technology Review

Country: North America > United States > California > Alameda County > Berkeley (0.26)

Industry: Leisure & Entertainment > Sports > Track & Field (0.96)

Technology:

Information Technology > Artificial Intelligence > Robots > Locomotion (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

Filters

Collaborating Authors

Track & Field

c28e5b0c9841b5ef396f9f519bf6c217-Supplemental.pdf

GAIA: Rethinking Action Quality Assessment for AI-Generated Videos

Supplementary Materials for GAIA: Rethinking Action Quality Assessment for AI-Generated Videos

Continual Learning for Multiple Modalities

1 Details for Dataset Partitioning

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts Ahmad Beirami

Real-time Monitoring and Analysis of Track and Field Athletes Based on Edge Computing and Deep Reinforcement Learning Algorithm

Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts Ahmad Beirami Vector Institute

1 Details for Dataset Partitioning

How AI taught Cassie the two-legged robot to run and jump